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Abstract 



Oh 

C/3 , In order to improve precision and efficiency sharing analysis should track both Ireeness and 

linearity. The abstract unification algorithms for these combined domains are suboptimal, 
hence there is scope for improving precision. This paper proposes three optimisations for 
tracing sharing in combination with freeness and linearity. A novel connection between 
equations and sharing abstractions is used to establish correctness of these optimisations 
£Nj , even in the presence of rational trees. A method for pruning intermediate sharing ab- 

04 ■ stractions to improve efficiency is also proposed. The optimisations are lightweight and 

therefore some, if not all, of these optimisations will be of interest to the implementor. 
, Keywords: Abstract interpretation, sharing analysis, freeness, linearity and rational trees. 
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1 Introduction 

A set-sharing analyser will usually also track freeness and linearity. This is be- 
cause freeness and linearity are cheap to maintain and result in more accurate, 
that is smaller, sharing abstractions which in turn improve the efficiency of the 
sharing component of abstract unification. However, current abstract unification 
algorithms for sharing, freeness and linearity are suboptimal. This paper considers 
how to improve the precision of sharing with freeness and linearity by considering 
the interaction of these components. These refinements do not incur a significant 
computational overhead. To this end three optimisations are given, along with ex- 
amples of where precision is gained. Their cost is discussed and correctness proved. 

The first optimisation follows from the observation that the algorithms for pair- 
sharing with linearity can sometimes out perform set-sharing with linearity (in 
terms of which pairs of variables may share). This is because of an independence 



check which pervades the set-sharing literature (from early work (Langen, 1991) to 



the most recent and comprehensive (Bagnara et ai, 2000D ). This check is in fact 



redundant. By removing this, the precision of abstract unification is improved, since 
linearity can be exploited more frequently. 



File (File, 1994) observed that freeness can be used to decompose a sharing 
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abstraction into a set of sharing abstractions. For each component of the decompo- 
sition, the sharing groups of that component do not (definitely) arise from different 
computational paths. Abstract unification can then be applied to each component 
and the resulting abstractions merged. This tactic has not been included in analysers 
owing to its prohibitive cost. The second optimisation is a lightweight refinement 
of abstract unification inspired by the decomposition. Though not as precise as the 
full decomposition, it does achieve the necessary balance between cost and benefit. 

Thirdly, an optimisation for pruning sharing groups is presented. This tactic 
demonstrates that sharing in combination with freeness can improve groundncss 
which, in turn, can improve sharing (even in the presence of rational trees). Put 
another way, it means that any optimal algorithm for sharing, freeness and linearity 
will have to consider subtle interactions between sharing, freeness and groundness. 

One principle of set-sharing is that the number of sharing groups should be 
minimised. As well as increasing precision, this can improve efficiency and possibly 
avoid widening. A fourth technique is proposed which can prune the size of inputs to 
the abstract unification algorithm by considering the grounding behaviour of sets of 
equations. Reducing the size of the inputs (and intermediate abstractions) simplifies 
abstract unification and can thereby improve performance. Whilst the technique will 
not theoretically improve the precision of the overall result, in practice, a precision 
gain might be achieved if widening is avoided within the unification algorithm. 

Correctness is expressed in terms of a novel concretisation map which charac- 
terises equations as their idempotent most general unifiers. This simplifies the cor- 
rectness arguments and in particular enables the abstract unification algorithms to 
be proved correct for rational tree constraint solving (as adopted by SICStus Pro- 
log and Prolog-Ill). To the best of the authors' knowledge, this is the first proof of 
correctness for a sharing, freeness and linearity analysis in the presence of rational 
trees. (Previous work for rational tree unification has either focused on pair-sharing 
( King, 2000 ) or set-sharing without freeness and linearity ( Hill et a/., 2002 )). 

In summary, this paper provides the implementor with a number of low-cost 
techniques for improving the precision and efficiency of sharing analyses. 



2 Preliminaries 

2.1 Trees and terms 

Let e denote the empty sequence, . denote sequence concatenation, and ||a|| denote 
the length of a sequence a £ N*. A tree (or term) over an alphabet of symbols F is 
a partial map t : N* — > F such that t(a) = t if a = s, otherwise t(a) = ti((3) where 
a = i.(3 and t = f(h, . . .,t n ). Let T(F) and T°°(F) denote the set of finite and 
possibly infinite trees over F. Let U denote a (denumerable) universe of variables 
such that F f~l U — 0, and let var(t) = {u 6 U \ 3a G N*.t(a) = u} where 
t e U U). Finally, \S\ denotes the cardinality of the set S. 
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2.2 Substitutions and equations 

A substitution is a (total) map 9 : U -> T°°(F U U) such that dom(O) = 
{u G [/ | 6*(u) 7^ u} is finite. A substitution can be represented as a finite set 
{x i ► | a; S cfom(0)}. Let rng(9) = U{var(9(u)) \ u £ aWi(#)} and let 

Sub denote the set of substitutions. If 9 = {xi i— > ti}f =1 then #(i) denotes the 
tree obtained by simultaneously replacing each occurrence of Xi in t with ij. For 
brevity, let 0(a;, a) = t(a) where 9(x) — t. An equation e is a pair (s = t) where 
s, t £ T°°(F U Z7). A finite set of equations is denoted -E and -Eqn denotes the set 
of finite sets of equations. Also define 6(E) = {9(s) = 9(t) \ (s = t) £ £}. The map 
eqn : Sub — » -Egn is defined eqn(9) — {x — t | (x > t) £ 9}. Where Y C [7, projec- 
tion out and projection onto are respectively defined 3Y.9 — {xi~^t£9\x^ Y} 
and 3Y.9 = 3(U \ Y).9. Composition 9 o -0 of two substitutions is defined so that 
(9 o ip)(u) = 9(ip(u)) for all u £ U. Composition induces the (more general than) 
relation < defined by 6 < i/j iff there exists 6 £ Sub such that ip — 8 o 9. A renaming 
is a substitution p € Swfe that has an inverse, that is, there exists p~ x £ Sub such 
that p" 1 o p = id. The set of renamings is denoted Rename. A substitution 9 is 
idempotent iff 9 o # = 9, or equivalently, iff dom(9) n rng(9) = 0. 



2. 3 Solved forms and most general unifiers 

A substitution is in rational solved form iff it has no subset {xi i— > X2, . . ., a;„ i— > xi} 
where n > 2. The subset of Sufr in rational solved form is denoted RSub. The set 
of unifiers of E is defined by: unify(E) = {9 £ Sub \ V(s = t) G E.9(s) = 9(t)}. 
The set of most general unifiers (mgus) and the set of idempotent mgus (imgus) are 
defined: mgu(E) = {9 £ unify(E) \ VV> £ unify(E).9 < if)} and imgu(E) = {9 £ 
mgu(E) | dom(9) n rng(9) = 0}. Note that imgu(E) ^ iff mgu(E) ^ ( jLassez 
aL, 1988| ) . An mgu can be renamed to obtain any other (as can an imgu) . 



Lemma 2.1 (Proposition 11 from ( jLassez et al, 1988 )) 

Let 9 £ imgu(E). Then <fr £ imgu(E) iff there exists {xi i— ► yi}f—i Q 9 such that 
<j) = {xi^ yi,yi^ Xi}?=i ° 0. 

One way to obtain an imgu is by considering limits of substitutions. 
Definition 2.1 

Let {t n \ n £ N} £ T°°(F U U). Then t = )xai n ^ 00 t n iff for all k £ N there 
exists I £ N such that for all m > I and ||a|| < k, t(a) = t m (a). Furthermore, if 
{9 n | n £ N} C Sub then lim JWOO 0„ = Ax.limn^oo 9 n (x). 

Note that lim„^oo 6»" exists iff 9 £ RSub ( [King, 200Cj ). Henceforth 6»°° abbreviates 
lim n _>cx) & n ■ If 9 £ RSub then 9°° is idempotent whereas if 9 is idempotent then 
0°° = 9. The following lemmas detail how limits of substitutions and composition 
of substitutions relate to an mgu. 

Lemma 2.2 (Lemmas 2.2, 4.3 and 4.4 from ( [King, 2000j )) 

1. 9°° £ mgu(eqn(9)) if 6 £ RSub. 

2. 6 o 6»°° £ mgu(E U egn(0)) if 6 £ mgu(9 co (E)) . 

3. 3(dom(0) \ rng(9)).5 £ mgu(9(E)) if 6 o 9 £ mgu(E). 
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2-4 Linearity 

Variable multiplicity is defined in order to formalise linearity. The significance of 
linearity is that unification of linear terms enables sharing to be described by more 
precise sharing abstractions (even in the presence of rational trees) . 

Definition 2.2 

The variable multiplicity map x : T°°(F U U) -> {0,1,2} is defined: x(t) = 
max({x(x, t) x G U}) where x( x jt) = rnin(2, \{a \ t(a) = x}\). 

If x(t) — 0; t is ground; if = 1, t is linear; and if x(t) = 2, t is non-linear. The 
next lemma details the forms of sharing barred by the unification of linear terms. 



Lemma 2.3 (Proposition 3.1 from (King, 200L )) 

If 9 G mgu({s = £}), x 7^ y and var(9(x)) n var(6(y)) ^ then either: x G var(s) 
and y G var(t); or x,y G var(t) and x( s ) = 2; or x G var(t) and y G var(s); or 
x,y G uar(s) and x(t) = 2. 



The correctness arguments for abstract unification require lemma 2.3 to be aug- 
mented with a new result - lemma 2.4. The proof of this lemma is analogous to 
that of lemma 2.3 detailed in (King, 200C). 

Lemma 2.4 

If G mgu({s = t}) and x(@( x )) = 2 then either: x G var(s) fl var(t); or x G var(t) 
and x( s ) = 2; or a; G var(s) and x(i) = 2. 



2.5 Groundness and sharing abstractions 

The abstract domains of interest in this paper are represented either as Boolean 
functions, or as sets or as sets of sets. Let X denote a finite subset of U. The set of 
propositional formulae over X is denoted by Boolx and Y abbreviates the formula 
AY". The (bijective) map modelx ■ Boolx — > p(p(X)) is defined by modelxif) — 
{MCX\ ip x (M) h /} where ipx{M) — MA A{^y | y G X\M}. The groundness, 
sharing, freeness and linearity domains over X are defined as follows: 

Definition 2.3 

Pos x = {/ e Boolx I X h /}, Sh x = {K p(X) I G 5}, Fr x = p(X) and 
Lm x = p(^). 

If S G S/iX) then each G G 5 is referred to as a sharing group. 

These domains are connected to the concrete domain of sets of equations by 
Galois connections induced by the concretisation maps. This approach leads to 
succinct statements of correctness. To obtain well defined concretisations, maps 
abstracting substitutions are introduced. It is then observed that the abstractions 
for equivalent idempotent substitutions are the same. 

Definition 2.4 

The abstraction maps a Pos : Sub — > Posjj and a x h : Sub — > Shx are defined: 
a Pos {6) = A{x ^ var(t) x ^ t G 0}, a s x h {6) = {occ(6,u) n X | u G U} and 
occ(6,y) = {u G U I y G var(9(u))}. 
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Lemma 2.5 

Let 6,<j) £ imgu(E). Then a Pos (0) = a Pos (</>), a s x h {9) = a x h ((/)), 0(x) e 17 iff 
0(a) G E7 and x(0(x)) < 1 iff x(0(a:)) < 1- 

Proof 

By lemma |2.l| there exists {xi i— ► C 6* such that (f> — p o where /? = {x^ i— ► 

1. Let x ^ t <E 9. Observe that {x i— > p(t),yi >—> Xi, . . . ,y n i— > x„} C p o and 
j/i £ var(t) iff Xj G var(p(t)), thus a Pos (0) |= a: <-> var(p(t)) A (A™ =1 yj <-> 
arj) |= x <-» var(t). Hence a Po;i (</>) |= a Pos (0). The other direction is similar. 

2. Observe that occ(po9, yi) = occ(9, Xi), occ(po9, X{) = occ(9, yi) and occ(po9, u) 
— occ(9,u) for all u £ dom{p) Urng(p). Hence af 1 ^) = a x h {(j)). 

3. and 4. Immediate. 

□ 

Instead of defining concretisation in terms of a particular imgu (the limit of a 



rational solved form (King. 2000)), an arbitrary imgu is used. This new approach 



simplifies correctness proofs. 
Definition 2.5 

The concretisation maps lx° s • E° s x —> p{Eqri), j x ■ Shx — > p(Eqn), 
j x r '■ P r x —* p(Eqn) and 7^ m '■ Linx — > p(Eqn) are respectively defined by: 

lx° s (f) = {Ee Eqn \ 39 G imgu{E).a Pos {9) \= /} 
71^(5*) = {E G Eqn \ 39 G imgu(E).a x h (9) C S} 
7^ r (F) = {£ G Bgn | 36* G imgu{E)Mx e F. 6{x) G U} 
j x in (L) = {£e Bgn | 30 G imgu(E).Vx G L.x(9(x)) < 1} 

Each free variable is linear so that 7 Pr (i^) n7^ in (L) = 7^ r (F) f)j x in (L\JF). This 
paper is concerned with combined domains and the following combined concreti- 
sation maps will be useful: 7f p ((S, F)) = 7f h (S) n 7| r (F) and 7f pi «S, F, L)) = 
lx F ({S,F))n lx ™(L). 



A connection is established in (Codish et al, 1999) which sheds light on the 



relationship between sharing and Boolean functions. The corollary (also observed 
in the long version of ( Bagnara et al., 2000| )) explains how this can be used to 



improve precision of combined domains. 



Lemma 2.6 (Observation 4.1 and lemma 5.1 from ( Codish et al., 1999| ) ) 
{X \ G | G G a x h (9)} C model x {a Pos (9)) where 9 is idempotent. 

Corollary 2.1 

lx° s {f)^lx h (S) = i Pos (f)ni x h (trimx(f,S)) where trim x (f,S) = {G G S \ 
X\G G model x{f)}- 

Finally, the following auxiliary operations will be used throughout the paper. Let 
S,Si G Sh x - The relevance map is defined rel(t,S) = {G G S|uar(t) n G ^ 0}; 
closure is defined S* = n{S' | S C S" A VGi,G 2 G S'.Gi U G 2 G S'}; and pair- 
wise union is defined Si W S 2 = {Gi U G 2 | Gi G Si A G 2 G S 2 }. Observe that if 
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var(rel(s, S))r\var(rel(t, S)) = then var(9(s))nvar(6(t)) = for all 9 £ imgu(E) 
and E £ -f^ h (S). Thus the independence check var(rel(s, S)) fl var(rel(t, Sj) = 
can verify that two terms s and t do not share under 9 (or equivalently E). 



3 Independence check in set-sharing 

The following example demonstrates that pair-sharing can sometimes detect inde- 
pendence when standard set-sharing unification algorithms cannot. 

Example 3.1 

Let X = {u,v,w,x,y, z} and consider E £ -f^ FL ((S, F, L)) where S — {0, {u,w}, 
{v,w}, {x 7 y}, {x, z}, {w,x}}, F = and L = X. Let 9' £ imgu(EU{w — x}). The 
set-sharing unification algorithms of (Langen, 1991 ; Bagnara et at, 2000| ) give the 



following abstraction S' = {0}U(S£, WS*) for 9' where S w = {{u, w}, {v, w}, {w, x}} 
and S x = {{x, y}, {x, z}, {w, x}}. Observe that {u, v, w} £ S* and {x,y, z} G S* 
and therefore S' does not assert the independence of u and v (similarly y and 
z). However, if S is interpreted as a set of pairs, then the pair-sharing abstract 
unification algorithms of (ICodish et ai, 199 it [King, 2000|) both give the abstraction 



SU {{w}, {x}, {u, x}, {u, y}, {u, z}, {v, x}, {v, y}, {v, z}, {w, y}, {w, z}} which states 
the independence of u and v (and similarly y and z). Note that this different does 
not stem from a difference in the set-sharing and pair-sharing domains, but derives 
from the way in which linearity is exploited in the abstract unification algorithms. 

The crucial difference between pair-sharing and set-sharing algorithms is that the 
former does not require the terms in the equation to be independent to exploit lin- 
earity. Put another way, to apply linearity the latter requires that var(rel(s, S)) D 
var(rel(t, S)) = when solving the equation s = t in the context of the sharing 
abstraction 5*. Lemmas |2.3| and |2.4| detail the forms of sharing that can arise in 
mgu({s' — t'Y) rational (and finite) tree unification where s' and t' are arbitrary 
terms. Observe that s' and t' are not required to be independent. Abstract uni- 
fication algorithms with the independence check are safe. However, this check is 
not fundamental to combining sharing with linearity. By observing how to exploit 
linearity more fully a more precise abstract unification algorithm can be obtained. 
This algorithm also explains why algorithms with the independence check are safe. 
The following abstract operator is used to approximate the multiplicity map in 



abstract unification. Lemma |3.1 asserts its correctness. 
Definition 3.1 



X (t,S,L) 



if 3x S var(S).x(x, t) = 2 

if 3a; e var(S).x £ varit) \ L 

if Eke, y £ var(t)3G £ S.x ^ y A x,y £ G 

otherwise 



Lemma 3.1 

If E e lx h ( S ) n lx m ( L ) and 6 G imgu(E) then *(0(f)) < x(t, S, L). 



Three Optimisations for Sharing 



7 



Proof 

Suppose x(6(t)) = 2. One of the following holds: 

• There exists x G var(t) such that xi x >t) — 2 and var(6(x)) ^ 0. Then 
x G var(S) so that x(t, S, L) = 2. 

• There exists x G var(t) such that x(@( x )) = 2. Then a; G var(S) and 
as G uar(i) \ L so that x(t, 5, L) = 2. 

• There exist x, y G var(t) such that x ^ y and uar(#(x)) niw(0(y)) 7^ 0. Then 
there exists GeS such that x, y G G so that 5, -L) = 2. 

□ 

The revised abstract unification algorithm (with the independence check removed) 



is detailed in definition 3.2, and theorem 3.f establishes its correctness 



Definition 3.2 {Abstract unification 1) 

Abstract unification amgui((S, F, L), s, t) — (S', F' , L') is defined: 

S s =rel(s,S) S t =rel{t,S) S' = (S \ (S s U St)) U S" G' = X\var(S') 



S s a S t if seFVteF 

(s* a s t ) n (S s a st) if x (s, S, l) = x (t, s, L) = 1 

S"' = < 5!W5 t ifyfs 



5* a 5 t 
S 5 a 5 t * 
s? a 5; 



ifx(s,S,L) = l 
if X (*,5,L) = l 
otherwise 



F' = 



F 

F\var(S s ) 
F\var(St) 
F\var(S s US t ) 



if s G F A t G F 
if s G F 
if t G F 
otherwise 



L' = f'UG'U 



L \ (var(S s ) n mr(5 t )) if *(«, S,L) = l/\ x (t, S,L) = 1 

L\var(S s ) if x(s, 5, £) = 1 

L\var(S t ) if x(*, 5 L) = 1 

L \ var(S s U St) otherwise 



A precision gain over previous algorithms follows since a closure is avoided if s is 
linear but not £ (or vice versa) and s and £ are not independent. When both s and 
t are linear, but not independent, two closures are required (as previously), but 
the resulting sharing abstraction may contain fewer elements owing to the pruning 
effect of intersection. When the independence check is satisfied, that is S s D St — 0, 
it follows that (S* a St) H (S s a S t *) = S s a S t . This explains why algorithms 
with the independence check are safe. Note that if s and t are both linear, but 
not independent, an implementor might trade precision for efficiency by computing 
St a S t if 15.1 < \St\ and S s a ST otherwise. 



Theorem 3.1 ( Correctness of abstract unification 1 ) 

Let E G 7f Fi ((S, F, L)), var(s)Uvar(t) C A and amgm^S, F, L), s, t) 

Then EU{s = i}e 7l Fi ((S', F', L'j). 



(S',F',L'}. 
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Proof 

Put E' = {s = t}. Let 9 G imgu(E) and 9' G imgu(E U E'). Observe that 
unify(9(E')) 2 unify(9(E')Ueqn(9)) = unify {E 1 U eqn{9)) = unify(EUE') ^ 0. 
Thus let S G imgu{9{E')) = imgu(9 oc (E')). By part | of lemma |J, (5 o 6>°° G 
mgu(eqn(9) U £') = myw(i; U £')■ Since dom(6>) n rng(5) =0), 5o9 oc =5o9e 
imgu(El) E'). 

1. To show afP(5 o 9) C S', let y G [7 and consider occ(<5 o9,y). 

(a) Suppose y ^ rng(S o 9). 

i Suppose y g^ dom{8 o 9), that is, (5 o 0(y) = y. Thus 9{y) = y' and 
S(y') = y. Suppose y ^ y' . Then y G dom{9), thus y ^ rng{5) which is 
a contradiction. Therefore y = y' , giving 9(y) — y and S(y) = y. 

A Suppose y g^ var(9(s)) and y ^ var(9(t)). Hence y g^ dom{5) and 
y £ rng(6), so that occ{8 o 9,y) (1 X = occ(9,y) n X £ S. But 
var(s) n occ(9, y) = and similarly var(t) n occ{9, y) = 0, so that 
occ((5 o 0, y) n X G S". 

B Suppose y G var(9(s)) and y var(9(t)). Since 5(y) = y, it follows 
that y G var(<5 o 9(s)) — var(S o Suppose y G rng(S), then 

y dom(9), hence y G rng(So9) which is a contradiction. Therefore 
y rng(5), thus y G var(9(t)) which is a contradiction. 

C Suppose y g" var(9(s)) and y G var(9(t)). Analogous to the previous 
case. 

D Suppose y G var(9(s)) and y G var(9(t)). Since i5(y) = y and 
y ^ rng(8 o 9), y rng{9). Thus y G war(s) and y G var(t). Since 
y g - rng(9), it follows that y g" dom(9), therefore y ^ rng(5). Thus, 
occ(S o 9,y) = occ{9,y). Therefore occ{5 o 9,y) n X 6 5 S since 
var(s) C X and occ(<5 o 0, y) n X G St since var(t) C X. Thus 
occ((5 o 6», y) n X G S". 

ii Suppose y G dom(5o9). Since y ^ rng(8o9), occ(5o9, y) n A = G <S". 

(b) Suppose y G rng(6 ° 9) \ var(9(E')). Then y g" dom(8) and y g^ rng(S) 
so that occ(# o9,y) = occ(9,y). Moreover, since y g" var(9(E')) it follows 
that occ(<5 ofl, !/ )nleS\(S s US t )C 5'. 

(c) Suppose y e rng(5o9)nvar(9(E')). Since occ(8,y) C var(9(s))L)var(9(t)), 
occ(So9,y)nX = U{occ(9,u)nX \ u G occ(£,y)} = (Ui? s ) U (Ui? t ), where 
i? s = {occ^, u) n A | v G var(9(s)) n occ(S, y)} and i? 4 = {occ(6», iu) n A | 
w G var(9(t))r\occ(S,y)}. If i? s = 0, then y ^ uar(<5 o 0(s)) = var(So9(t)), 
hence i? t = and occ(6 o 6>, y) n X = G S'. Likewise occ(S o 9,y) X = 
G S' if -R t = 0. Thus suppose i? s ^ and i? 4 ^ 0. Since var(s) C X, 
i? s C S* s and since var(t) Q X, R t C. St- 

i Suppose s G F. Thus #(s) G [/, hence \R S \ — \var(9(s))\ — 1. Moreover 
x($( s )) < 1- Suppose \R t \ R s \ > 1- Thus there exists u ^ v such 
that it, v G var(9(t)) \var(9(s)) and war((5(w)) n uar((5(u)) ^ 0. This 
contradicts lemma |2.3| , hence |i?t \ i? s < 1. Thus occ(S o 9,y) C] X G 
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ii Suppose t £ F. Analogous to the previous case. 



iii Suppose x(s, S, L) = 1. Thus x(#(s)) < 1. As with case |l(c)i| , it follows 
that \R t \ R s \ < 1. Thus occ(c5 o 0, y) n X G S* W S t . 

iv Suppose S,L) = 1. Analogous to the previous case. 

v Otherwise occ(S o 0, y) n X £ S* W S£. 

2. It is straightforward to show <5 o 0(x) G C/ for all x £ F'. 

3. To show x(<$ o 0(x)) < 1 for all x £ L' . Observe x(& ° 0(x)) = if x £ G and 
X(S o 0(x)) = 1 if x £ F', Hence, let x £ L C X and suppose x(<5 ° = 2. 

(a) Suppose x( s jS,L) — 1. By lemma |3~l|, x(#( s )) < 1- 

i Suppose there exist it, u G var(9(x)), u ^ v such that var(S(u)) n 
uar(£(u)) 7^ 0. By lemma |2~3| either: 

Aug var(8(s)) and u G var(9(t)), hence a; G occ(9,u) D X £ S s , and 
therefore x £ L'. 

B u £ var(8(t)) and v £ var(6(s)), hence a; G occ(0,v) flIeS s , and 
therefore x G" L' . 

C u,v £ var(6(s)). Hence x £ occ(8, v) n X £ S s , and thus x $ L'. 

ii Suppose there exists u £ var(9(x)) such that x(5(ii)) = 2. By lemma [Q| , 
M G var(9(s)), thus a; G occ(6, u) f) X £ S s and therefore a; G" L'. 

(b) Suppose x(t, S, L) = 1. Analogous to the previous case. 

(c) Otherwise observe that either: 

i There exist u,v £ var(9(x)), h/d such that var(S(u))r\var(5(v)) 7^ 0. 
Thus u £ var(9(E')) and x £ occ(9, u) n X £ S s U 5 t . Hence x £ L' . 

ii There exists u £ var(6(x)) such that x(3( u )) — 2- Thus u £ var(6(E')) 
and x £ occ(9, u) D X £ S S U S t . Hence x £ L'. 

□ 

Example 3.2 

Consider again example |3.1[ Observe that amgui{(S, F, L),w,x) — {S',F',L') 
where S' = {0} U (S* W S x ) (~l (S^, l+l 5*) = {0, {«, to, x}, {w, to, x, y}, {it, w, x, z}, 
{v, to, x}, {v, w, x, y}, {v, w, x, z}, {w, x}, {w, x, y}, {to, x, z}}, F' = and V = 0. 
This asserts the independence of u and v (similarly y and z), as required. 



The following example, adapted from ( Langen, 1991 ), illustrates that closure can 



be required to abstract the unification of linear terms. 
Example 3.3 

Let X — {to, x, y, z} and observe E £ ^y^ FL ((S, F, L)) where E = {w = f(x, y, z)}, 
S = {0, {w, x}, {w, y}, {w, z}}, F — and L = {w,x,y,z}. Let E' — {w = 
f(z,x,y)} and note that 9' £ imgu(E U E') where 9' — {w t— ► f(z,z,z),x <— * 
z,y i-> z}. Thus EUE' £ ^ FL ({S', F', L')) where S' = {0, {w, x, y, z}}, F' = 
and L' = {x,y, z}. Indeed, if = rel(w,S) = x}, {w, y}, {w, z}} and 5* = 
rel(f{z,x,y),S) = {{«;, x}, {«;, y}, {w, z}} then (5J bd S t ) n (5. bd 5 t *) = {{w,x}, 
{w,y}, {w,z}, {w,x,y}, {w,x,z}, {w,y,z}, {u;,x,y,z}}, thus amgui({S,F,L), 
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w, f(z,x,y)) yields a safe, though conservative, abstraction. Closure is required 
to construct the {w, x, y, z} sharing group. 



4 Decomposition of set-sharing 



File (File, 1994) observes that different sharing and freeness abstractions can rep- 
resent the same equations, that is, 7x ((Si, F)) — 7^ F ((S , 2, F)) does not imply 
that Si = 5 2 . Therefore the relationship between Sh x Fr and the concrete do- 
main is a Galois connection rather than an insertion. An insertion is constructed 
by using F to decompose S into a set of sharing abstractions K F (S) such that each 
B G K F (S) does not include sharing groups that definitely arise from different com- 



putational paths. The following definition and lemma from ( File, 1994 ) formalises 
this decomposition, henceforth referred to as the File decomposition. 



Definition ^.1 
The map K F (S) 



Sh — > p(Sh) is defined by: 



K F (S) = { B 



B CS A F C var(B) A 
VGi, G 2 e B.{G X ± G 2 -> Gi n G 2 n F = 



Lemma J^.l 
J S x F ((S,F)) 



U{^ F ((B.F))\BeK F (S)}. 



Using the above, abstract unification can be refined to U{amgu((B, F, L), s, t)\B £ 
K F (S)}. Abstract unification computed in this way does not merge sharing groups 
arising from different computational paths, and thereby improves precision. Cal- 
culating Kp(S) is expensive and the number of calls to amgu is \Kp(S)\ (which 
is potentially exponential in \S\). However, this tactic suggests lightweight refine- 
ments to closure (*) and pair-wise union (ttJ) that recover some precision at little 
cost. Since two distinct sharing groups which contain a common free variable must 
arise from different computational paths, they cannot describe the same equation 



and therefore need not be combined. Definition 4.2 details the refined abstract 



unification algorithm and theorem 4.1 builds on lemma 4.2 to establish correctness 



Definition 4-2 (Abstract unification 2) 

Abstract unification amgu2({S, F, L),s, t) — (5', F' , L') is defined: 



S" 



( (s*^ F s t )n (s s u F s;n if x (s,s,l) ■■ 

S* F U F S t iix(s,S,L): 
S s ti^Sr it X (t,S,L)-- 
S* F tt F St F otherwise 

Si Wf S 2 = (J {Gi U G 2 |Gi e Si A G 2 e S 2 A Gi ^ G 2 
S* F =f){S' \S C S' AVGi,G 2 g S'.Gi n G 2 nF = i 



X (t,S,L) = l 

1 
1 



Gi n G 2 n f = } 
■* Gi U G 2 G S' } 



where S' , S s , S t , F' and L' are defined as in definition 3.2 
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Notice that the use of freeness is completely absorbed into * F and W f ■ The following 
lemma demonstrates that W F and * F coincide with W and * for each element of the 
File decomposition. The correctness of abstract unification (amgu 2 ) follows from 
this result. 

Lemma 4.2 

1. If B G K F (S) and RCB, then R* = R* F . 

2. If B G K F {S) andi?i,i? 2 C F, thenFiWH 2 = HiW F H 2 , R{^R 2 = R\^ F R 2 , 
Hi l±l H^ = Hi l±) F i?2 and w R 2 = R i &f R^- 
Proof 

1. Proof by induction. 

(a) Suppose F = 0. Then R* = % = R* F . 

(b) Suppose F = {G}UH. By the hypothesis, R'* = R'* F . Since RCB, then 
for all G' G R', G H G n F = 0. Hence F* = R* F . 

2. (a) To show R 1 W F 2 = Hi W F H 2 . Let G t G H ( . If Gi n G 2 n F ^ then 

Gi = G 2 . Hence G U G 2 £ Hi W F H 2 . 

(b) ToshowF^WH 2 = HJW F H 2 . Let Gi G Hf and G 2 G R 2 . Then Gi = UQi 
for some Q x C Hi. Put Y = Gi nG 2 (IF, Q[ = {G G Qi | Gn Y = 0} and 
Qi = Qi \ Q'i- Observe that \Q'{\ < 1 and Q'( C {G 2 }. Thus Gi U G 2 = 
(UQi)UG 2 . Since (UQi)nG 2 nF = it follows that G x UG 2 G R\ W F H 2 . 

(c) To show Hi W H 2 = Hi W F H|. Analogous to the previous case. 

(d) To show R\ Wi?^ = R\ W F H|. Let Gi £ Hf and G 2 G i^. Then G, ; = UQ t 
for some Q, C F t . Put Y = Gi n G 2 n F, Q\ = {G G Qi \ G n Y = 0} and 
Qi =Qi\Q[- Observe that \Q'{\ < 1. 

i Suppose \Q'{\ = or |Q 2 '| = 0. Then Gi nG 2 C\F = 0, hence Gi UG 2 £ 

H* y F H 2 . 

ii Suppose \Q'{\ = |Q' 2 '| = 1. Hence Q'/ = Q 2 ', thus GiUG 2 = GiU(UQ' 2 ). 
Since Gi n (UQ 2 ) n F = it follows that Gi U G 2 £ HJ W F R* 2 . 

□ 



Theorem 4-1 (Correctness of abstract unification 2) 

Let F G lx FL {{S, F, L)), var(s)Uvar(t) C X and amgu 2 {(S, F, L), s, t) = {S' , F' , L') 
Then £U{s = !}e 7| Fi ((5", F', L')). 

Proof 

Observe F G ^ F ((S,F)) and F G 7x(F). By lemma |DJ there exists F G F/ F (5) 



such that F e 7 f F ((F, F)), hence F G ^ FL ((B, F, L)). Observe that if s G F then 
5* F = 5 S (and likewise for t G F) and hence by lemma IO, amgU\({B, F, L),s, t) — 



amgu 2 ((B,F,L),s,t). By theorem £U{s = i}e j^ FL (amgu 1 {(B, F, L), s,t)) 
= lx FL ( am 9 u 2{{B,F,L),s,t)), thus F U {s = f} G ^ FL {amgu 2 ({S,F,L),s,t)). 
□ 



The proof explains why the standard freeness tactic is a specialised version of the 
File decomposition. 
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This refinement is only worthwhile if redundant sharing groups are introduced in 
analysis. Although it can be shown that projection and join do not introduce re- 
dundancy, the following example indicates that redundant sharing groups can arise 
in abstract unification (amgu\) and that the refined abstract unification (amgva) 
can avoid some of these redundant sharing groups. 
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Example 4-1 

Let X = {x,y, z], S = {0, {x, y}, {y, z}}, F — {y} and L = {y}. Suppose s = x 
and t = z. Then S s = {{x,?/}} and St = {{x, z}} so that amgui((S, F, L), x, z) — 
({0, {x, y, z}}, 0, 0). However S S * F — {{x, y}} and S t * F — {{x, z}} and in particular 
S S * F t±l F S t * F = so that amgu 2 ({S, F, L),x, z) = ({0}, 0, {x, y, z}). 

The following example demonstrates that amgu 2 is not as precise as the full File 
decomposition. 

Example 4-2 

Let X = {x, y, z}, S = {0, {x}, {z}, {x, y}, {y, z}}, F = {x, y, z] and L = {x, y, z}. 
Suppose s — x and t — z. Then S s — {{x}, {x, y}} and St = {{z}, {y, z}}, hence 
S S * F = S s and S t * F = S t . Thus S S * F W F S t * F = {0, {x, z}, {x, y, z}}. It follows that 
amgu2((S, F, L), x, z) = ({0, {x, z}, {x, y, z}}, F, L). However, the File decomposi- 
tion gives K F (S) = {Si, S 2 , S 3 , S^} where Si = {{x}, {y, z}}, S 2 = {0, {x}, {y, z}}, 
S-3 = {{x,y},{z}} and £4 = {0, {x, y}, {z}}. Moreover, amgui({S 2 , F, L) . x, z) = 
amgmdS^FjL)^^) = ({0, {x, y, z}}, F, L). Since Si C S 2 and S 3 C 5 4 , the File 
leads to the sharing abstraction {0, {x, y, z}}, which is more precise. 

5 Pruning of set-sharing 

Pruning sharing groups is advantageous for efficiency and precision. By reducing the 
size of an abstraction, abstract unification works on smaller objects and is therefore 
faster, even if no precision is gained. Of course, the benefit of pruning for efficiency 
needs to outweigh its cost. 



5. 1 Pruning with freeness via groundness 

Surprisingly, combined sharing and freeness information can improve groundness 
propagation and sharing even for rational tree unification. For example, the equa- 
tion x = f(y, z) can be abstracted by [x <-> z) A (x <-> y) if x and y are free variables 
that share. This is because, in this circumstance, finite tree unification fails for 
x = f(y, z) whereas rational tree unification binds x and y to /(/(. . . , z), z). Ab- 
stract unification can use the freeness of variables in the equation to extract hidden 
groundness information (for distinct computational paths) and thereby prune shar- 



ing groups and improve precision. The proof of theorem 5.1 again uses the File 
decomposition. 

Definition 5.1 [Abstract unification 3) 

Abstract unification amgu^{{S, F. L), s, t) — (S", F', L') is defined: 

f Uggs s trim x ^Y,{G}<S F S t ) if seF At <£U 
S' = (S\(S s US t )) U < [\ G£ g t trim x {Z t,S„ W F {G}) if t G F As £ U 
[ S" otherwise 

where Y = var{t) \(GF\F), Z = var(s) \ (G(~)F), S s , S t , S", F' and L 1 are defined 



as in definition 4.2 
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Theorem 5.1 (Correctness of abstract unification 3) 

LetF G 7 | Fi ((S*,F, J L)), var(s)Uvar(t) C X and amgua((S, F,L),s,t) = (S',F',L'). 
Then £U{s = !}£ 1 X FL (.( S '> F 'i L '))- 

Proof 

Suppose s G F. By lemma O there exists B G ifjr(5) such that £ G j x FL ((B 1 F)) 



and by theorem 4-l|, E'Ujs = i} G l x [ B ') where 5 ' = (5\(5 ! U5 t ))U(B 8 Wj r B 



£? s = rel(s,B) and £? t = rel{t,B). Let G imgu(E). Since s G F, 0(s) = x for 
some x G U. Furthermore, s G G for all G G S s . Since s G F, B s = {G} where 
G = occ(0, x). Observe that 9(y) = x for all y G GnF. Since t £ J7, 6»(i) ^ {/, hence 
a^ os ({6»(s) = 6»(i)}) |= s <-» y. Moreover, mgu(E U {s = i}) = mgu(eqn(8) U {s = 
t}) = mgu{eqn{6) U {0(s) = 0(t)}). Thus aJ os (F U {s = t}) |= a$ 0;5 ({6l(s) = 
9(t)}) \= s <-> y. The result follows by corollary |2.1|. The t £ F case is analogous 



and the otherwise case follows immediately from theorem 4.1. □ 



The following example illustrates the gain of precision. Note that even the File 
decomposition cannot match this level of precision. 

Example 5.1 

Let X = {x,y,z}, S = {0, {x, y}, {y}, {z}}, F = {x,y} and L = {x,y}. 
Suppose s — x and t — f(y, z). Consider the File decomposition, that is, Kp(S) — 
{S 1 ,S 2 ,S 3 ,S 4: } where Si = {{x,y}}, S 2 = {9,{x,y}}, S 3 = {{x,y},{z}}, 
5 4 = {0,{x,y},{z}}. Then amgui({S4 7 F, L),x, f(y, z)) = (S',0,0) where S' = {0, 
{x, y}, {x, y, z}}. Since Si C 5*4 for all i G {1, 2, 3}, the decomposition results in the 
sharing abstraction 5". Moreover, amgu2((S, F, L),x, f(y, z)) — (S', 0, 0). However, 
amgu 3 ((S,F,L),x,f(y,z)) = {trim x (x <-> z , 5"), 0,0) = ({0, {x, y, z}}, 0, 0) which 
is more precise. 

Example 5.2 

Let X = {x,y,z}, S = {0, {x, y}, {y, z}}, F = {y} and L = {y}. Suppose 
s = x and t = z. Since x, z G U, amgu 3 ({S, F, L), x, z) = amgu2((S, F, L),x, z) 
= ({0, {x, y, z}}, 0, 0} whereas the File decomposition produces ({0}, 0, {x, y, z}) 
(see example |4.1|). 



Example 5.2 shows that amgu 3 is not uniformly more precise than the File de- 
composition, hence is sub-optimal. Nevertheless, this pruning tactic suggests that 
any optimal abstract unification algorithm for sharing, freeness and linearity, in 
the presence of groundness, will have to consider subtle interactions between the 
components. 



5.2 Early pruning with groundness 

Sharing abstractions can always be pruned by removing sharing groups which con- 
tain ground variables. Common practice is to schedule the solving of equations so 
as to first apply abstract unification to equations on ground terms ( [Langen, 1991 ) . 



Moreover, (Muthukumar & Hcrmencgildo, 1992) details a queueing/dequeueing 
mechanism for maximally propagating groundness among systems of equations. 
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This can involve repeated searching. This section proposes a revision of this tactic 
that applies groundness to the complete set of equations (without repeated search- 
ing) and then uses the resulting groundness information to prune sharing before 
abstract unification is applied. The gain is that searching and scheduling are no 
longer required (the mechanism is single pass) and that the disjunctive groundness 
information captured by Pos can be exploited so that abstract unification can po- 
tentially operate on smaller abstractions. Observe that groundness information will 
normally be tracked by Pos anyway, thus the computational overhead is negligi- 
ble. To formulate this strategy, abstract unification is lifted to sets of equations as 
follows: 

Definition 5.2 

The map amgUi(T,E) = {T' | (T, E) «<■** (T',0)} is defined by the least relation 
~> C (Sharex x Fr x x Lin x ) 2 such that (T, {s = i) U E) ~> {amgu^T, s, t),E). 

The following theorem states correctness of the early pruning using groundness for 
amgu\, amgui and amgu^. 

Theorem 5.2 

Let E G 7 |° s (/) n lx FL ({S,F,L)), E U E> E ^ os (f), Y = {y e X \ f \= y}, 
S' =trim x (f AY,S), F' = F \var(rel(Y, S)), L' = L U Y, var(E) C X and 
T E amgu t ({S', F', L'),E'). Then E U E' E lx FL {T). 

Proof 

Let 9 G imgu(E) and 9' E imgu(E U £")■ Since 9' G unify(E), 9 < 9' and there 
exists C G Sub such that ( o 9 = 9' . Since 9' G unify(E'), ( G unify(9(E')) so that 



mgu(9(E')) ^ 0. Let 6 E imgu{9(E')) = imgu(9°°{E')). By part @ of lemma ^2 
S o 9 = 5 o9°° E mgu(eqn(9) U E') = mgu{E U £"). Thus there exists p G Rename 
such that p o 8 o 9 = 9' . Now var(9'(y)) = for all y G Y, hence var(S o 9(y)) = 
for all y £ Y. Put Z = U{var(%)) | y G Y}, </> = 3Z.<5 and V = 3Z.S. Let z G Z. 
Then there exists y G Y such that z G var(9(y)). But var(5 o 9(y)) — 0, hence 
rgn(ip) = and 5 = ip o (f). Thus ip o (f> £ mgu(9{E')) and by lemma [2.2| part [|, 
3(dom((/)) \ rng((f>)).ip G mgu((j) o 9{E')). Furthermore, 3(dom((f>) \rng(<p)).ijj = ip 
hence ■0 £ mgu{4> ° 6{E')). Since </> o is idempotent, G mgu{(4> o 9)°°(E')). By 
lemma |J, part |, 5/>o^o0 = -0o(0o 6>)°° G mgu(eqn{cj) o 9) U E'). Thus 6>' G 
imgu(eqn((/) o 9) U E'). 

To show egn(0 o 0) g -/ x h {trim(f A Y, 5)). Let u e [/. If occ(0 o 6>, u) = then 
occ(cj) o9,u) (~\X E S trivially. If occ((f> o 9, u) ^ then occ{4> o9,u) = occ(0, it) since 
rn#(</>) = 0. Thus occ(<po9, u)DX G 5. Therefore eqn(<t>o6) G 7f' l (S'). By lemma [^2 



part 0, <5o6> g mgu(EUeqn(9)). But 0' G mgu(EUeqn(9)) and therefore there exists 
p G Rename such that po5o6» = <9'. Thus a Pos (5o0) f= a Pos (,9 0(5o(9) = a Pos (0') |= 
Y. Observe that if a Pos (6 o 9) \= u then a Pos (<?(> o 9) ^ u hence a Pos (0 o 0) |= Y. 
Since a Pos (0 o 9) \= a Pos {9) \= f, it follows that a Pos (</> o 9) \= f A Y. Therefore 
egn(0 o 0) G 7| os (/ A Y). By corollary O, egn(0 o 0) G -f x h {trim(f A Y, 



To show (/> o 9(x) G {/ for all x G F'. Let x E F and a; ^ uar(re/(Y, S")). Since 
x g" var(rel(Y, S)), x E' occ(9, u) D X or y ^ occ(9, u) D X for all it G U and y EY. 
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Since x € X and Y C X, var{6{x)) n var(6(y)) = for all y e F. Hence 0(x) Z, 
thus 6{x) £ dom(<p), therefore <p o 0(x) E f7. Thus eqn(<p o 6) G 7f r (F'). 

To show x(</> ° 0(x)) < 1 for all .t e L'. Since rng(4>) = 0, x(0 ° 0(z)) < 1 for all 
x € L. Moreover, a Pos (<j>o 8) |= Y" and therefore x(<^°0(aO) < 1 for all x €Y. Thus 



eqn(tpo 9) G 7^ ln (L'). The result then follows by induction on i5 and theorems 3.1 



4.1 and 5.1. □ 



The following example illustrates the computational advantages of early pruning. 
Example 5.3 

Let X = {u, v, x, y}, S = {0, {x}, {y}, {u}, {v}}, F = 0, L = and / = x V y. Let 
E' = {x = f(u, v),x = y} so that /' = (x\/y)/\(x <-> (uAv))A(x <-> y) = xAyhuAv. 
Then Y = {x,y,u,v} so that f AY = xAyAuAv and S' = trim x (f AY, S) = {0}. 
Hence amgu 3 {(S, F, L),E') reduces to amgu 3 ((S' ', F, L),E') = ({0}, 0, 0). Without 
this tactic, no equation of E' will possess a ground argument and both calls to 
arngu^ will involve non-trivial sharing group manipulation. 



6 Conclusion 

This paper has given correctness proofs for sharing analysis with freeness and lin- 
earity which hold in the presence of rational trees. The abstract unification algo- 
rithms are themselves novel - incorporating optimisations for both precision and 
efficiency. Specifically, the independence check which can prevent linearity from be- 
ing exploited has been removed. In addition, refined closure and pair-wise union 
operations have been derived from the File decomposition. A further precision opti- 
misation has been presented which exploits an interaction between sharing, freeness 
and groundness, which shows the subtlety that an optimal algorithm will need to 
address. These optimisations have been chosen to balance precision against effi- 
ciency whilst not changing the underlying representation of the abstract domains. 
They are ordered according to their anticipated degree of usefulness. This work 
provides the implementor with a suite of new optimisations for abstract unification 
algorithms for sharing, freeness and linearity. 
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